Efficient modularity optimization by 
multistep greedy algorithm and vertex mover refinement 
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Identifying strongly connected substructures in large networks provides insight into their coarse- 
grained organization. Several approaches based on the optimization of a quality function, e.g., the 
modularity, have been proposed. We present here a multistep extension of the greedy algorithm 
(MSG) that allows the merging of more than one pair of communities at each iteration step. The es- 
sential idea is to prevent the premature condensation into few large communities. Upon convergence 
of the MSG a simple refinement procedure called "vertex mover" (VM) is used for reassigning ver- 
tices to neighboring communities to improve the final modularity value. With an appropriate choice 
of the step width, the combined MSG-VM algorithm is able to find solutions of higher modularity 
than those reported previously. The multistep extension does not alter the scaling of computational 
cost of the greedy algorithm. 

PACS numbers: 89.75.Fb,05.10.-a,89.75.Hc 
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I. INTRODUCTION 

The networks under study in natural and social sci- 
ences often show a natural divisibility into smaller mod- 
ules (or communities) originating from an inherent, 
coarse-grained structure. In general, these modules are 
characterized by an abundance of edges connecting the 
vertices within individual communities in comparison to 
the number of edges linking the modules. 

To detect these partitions several algorithm- or score- 
based approaches have been developed and applied. Very 
popular became the approach introduced by Girvan and 
Newman [1] based on the quality function called "modu- 
larity" for partition assessment. This scoring function 
compares the actual fraction of intracommunity edges 
with its expectation in the random case given an iden- 
tical degree distribution. The partition with the highest 
value of the scoring function is then considered to be the 
optimal splitting. The modularity Q is defined (for undi- 
rected networks) as 
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with the weights of all edges linking pairs of vertices 
in community i, di the sum over all degrees of vertices 
in module i, L the total weight of all edges, and Nc the 
number of communities. 

Intrinsically, the modularity based approach does not 
prescribe the usage of a particular optimization proce- 
dure. In practice, a strategy for optimization has to be 
chosen. The modularity optimization is a NP-hard prob- 
lem [2]. Therefore, only an exhaustive search reveals 
the optimal solution for a generic network. This type 
of search is extremely demanding and only in a few cases 
feasible. Thus, many heuristic approaches such as ex- 
tremal optimization [3], simulated annealing [4], and the 
greedy algorithm [5] have been developed, refined, and 



successfully applied. Among the published approaches 
the greedy algorithm is one of the fastest techniques [6]. 
On the other hand, many examples show that the greedy 
algorithm is not capable of finding the solutions with the 
highest modularity value. Furthermore, recent studies 
have provided evidence that modularity [7] and Potts 
model based approaches [8] are endowed with an intrin- 
sic resolution limit (small modules are not detected and 
amalgamated into bigger ones). Thus, each community 
has to be refined by subduing it as a separate network 
to the community detection algorithm. Therefore, a fast 
and accurate optimization technique is necessary. 

In this article, we enhance the greedy algorithm by a 
multistep feature in combination with a local refinement 
procedure. The enhanced algorithm finds partitions with 
higher modularity values than previously reported. This 
paper is organized as follows. In Sec. II we introduce 
both procedures and describe the motivation for their 
construction. In addition, we discuss performance ori- 
ented implementations and estimate their running times. 
Benchmarking results for a set of real- world networks and 
a comparison with other published results are presented 
in Sec. III. The conclusions are in Sec. IV. In this paper, 
all networks are considered as undirected. The extension 
to directed networks is straightforward. 



II. THE ALGORITHM 

A. Multistep Greedy algorithm (MSG) 

The classical greedy algorithm (first application in 
Ref. [5]) joins iteratively the pair of communities that 
improves modularity most in each step. The essential 
idea of the "multistep greedy" algorithm (MSG) is to pro- 
mote the simultaneous merging of several pairs of com- 
munities at each iteration. The pseudocode of the MSG 
algorithm is presented in algorithms 1 and 2, and an illus- 
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if 



then 



Each vertex is a community 
Calculate the modularity change matrix AQ 
Determine the community degrees di 
while pair with AQ^ > exists do 

for all element (i, j, AQij) in AQ matrix, parsed w.r.t. 
decreasing AQ and increasing do 

AQij > in best I values in AQ matrix 
i and j unchanged in iteration 
MergeCommunities(ij) 
end if 
end for 

end while 

Algorithm 1: Flowchart of the MSG algorithm. The 
modularity change is calculated according to Eq. (1). 
Details of the algorithm are given in algorithm 2. 



trative example is given in Fig. 1. The MSG-algorithm 
starts with each vertex separated in its own commu- 
nity. At each iteration the modularity change AQij upon 
merge of each pair of connected communities (i, j) is cal- 
culated (while nonconnected pairs are ignored because 
their merging yields a negative modularity change) . The 
triplets (i, j, AQij) are parsed in the order of decreasing 
AQ-value and increasing community index. Those com- 
munity pairs (i, j) are joined which fulfill the following 
two criteria: 



1. The modularity change AQij is within the I most 
favorable values (levels) and positive. 



2. "Touched-community-exclusion-rule" (TCER): Nei- 
ther module i nor j is present in another pair in- 
ducing a higher modularity change. 



Convergence is reached when all pairwise merges of com- 
munities decrease modularity (by induction one can prove 
that all merges in further iterations would decrease mod- 
ularity). A level encompasses all triplets (i, j, AQij) with 
equal AQij -value and the level parameter I is kept con- 
stant. By construction the level parameter is always 
smaller than the number of edges in the network. 

The multiple levels promote the concurrent formation 
of multiple centers. Simultaneously growing community 
centers hinder the condensation into few large communi- 
ties (few formed communities scrape all vertices as the 
establishment of a new community is too expensive in 
modularity) as observed in the classical greedy algorithm. 
The TCER is a second mean against excessive aggrega- 
tion into few large modules. This rule permits the addi- 
tion of only one community to an existing community per 
algorithm iteration. Furthermore, the TCER guarantees 
that the modularity change upon all performed merges is 
just the sum over the corresponding AQ elements which 
improves efficiency. 



B. Implementation details of MSG 

The key observation for an efficient implementation of 
the MSG is the following: Upon merge of communities 
i and j only those AQ-elements concerning either of the 
two modules have to be recalculated. When the modules 
i and j are joined into a new one called /, the updated 
modularity changes AQ"£ W (module k is connected either 
to community i or j) reads (see Sec. II in Ref. [9] for 
details) 
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^■Qik + A-Qjk h j and k pairwise connected 
AQik — 4tt i and k connected, j and k not 
AQjfc — 2j^t 3 an d k connected, i and k not 

(1) 

with d x the sum over all degrees of vertices in community 
x = i,j and L the total edge weight. 

Further efficiency improvements are gained from an ap- 
propriate choice of data structures. A set (implementa- 
tion taken from the C-\ — \~ STL-library) is a sorted binary 
search tree. In a set individual elements can be found 
or inserted in 0(log(n)) time (n the number of elements) 
and the extremal entries are found in constant time. The 
modularity changes are stored in the AQ matrix imple- 
mented as vector of row structures. The ith row consists 
of a set with elements (j, AQij ) (j a module linked to 
the community i) ordered according to the community 
index j. This data structure obsoletes a separate stor- 
age of the topology information. The extraction of the 
best I modularity changes is handled via the level set. 
For each pair of connected communities i and j the el- 
ement (min{«, j}, max{i, j}, AQy ) is added to the level 
set. The level-set elements are sorted with respect to 
decreasing AQ and increasing index values. The degree 
information is stored in a vector henceforth named d. 
In each iteration a Boolean vector called touched stores 
whether a community has already been modified in the 
same round. To save the time to determine the highest 
index of a present communities, the number of vertices 
(initial length) is chosen as length of the touched vector. 

The implementation details of the MSG algorithm are 
listed in algorithm 2. The calculation of the community 
degrees involves one parse of the edge information. In 
the second parse of the edge information the AQ matrix 
and the level set is filled. The initial modularity change 
AQij upon join of modules (at this stage the vertices) i 
and j is calculated as (see Sec. II in Ref. [9] for details) 
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with / the weight of the edges connecting the vertices i 
and j, d x the degree of vertex x = i,j, and L the total 
edge weight. The modularity value of the initial partition 
is (TV the number of vertices) 
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FIG. 1: Effect of different values of level parameter during first MSG-iteration on example network. 



Each vertex is a community 

Calculate community degrees d and the AQ matrix 

Determine the initial modularity Q <— Qo = — X^i=i Tl? 
level set <— set of AQ elements AQij), sorted with 
respect to decreasing AQ and increasing 
while first element of level set has AQ > do 

touched <— (0, . . . , 0) Boolean, A^-dimensional vector 

(N = No. vertices) 

{touchedi = 1, if module i is modified in iwMe-loop} 
MP <— subset of level-set elements (i,j,AQij) with 
AQij > and AQij among highest I values 
for all elements (i,j,AQij) of MP do 

if (not touchedi) and (not touched]) then 
while parse AQi. and AQj. concurrently do 

!AQik + AQjk i, k and j, k are linked 
AQ ife - ?0 

AQ ki <- AQ lfe 

Update the knei set 

Update the modularity Q 
end while 
Empty AQj. 

Flag touchedi, touched j <— 1 
Update degrees: di <— 
end if 
end for 
end while 



i and k are linked 
j and fc are linked 

<- Q + AQi fc 



+ dj,dj 







Algorithm 2: Performance-oriented implementation of 

MSG algorithm. The vector touched contains the 
information for the touched-community-exclusion-rule 
(TCER). 



The algorithm iteration starts by initializing the touched 
vector. Subsequently, the Level-set is parsed and all ele- 
ments with positive AQ value, whose modularity change 
is among the best I (external level parameter) different 
values, are stored in a set named MP conserving the or- 
der of the level set. In this order the module pairs are 
merged unless one of them was part of a amalgamation in 
the same algorithm iteration. In the merge process, the 



changed AQ matrix elements are calculated as described 
at the beginning of this paragraph. To determine which 
case applies in Eq. (1) the fact that each row of the AQ 
matrix is ordered with respect to the community index 
can be used. More precisely, parse for the merge of mod- 
ules i and j the corresponding rows concurrently. For 
each row define an momentarily considered element p. If 
the community index of pi is equal to the one of pj , the 
first case applies and advance both p's to the next element 
in the corresponding row. If the index k of pi is lower 
than the one of Pj calculate the AQ"£ W element (/ the 
name of the merged community) according to the second 
case and advance (if possible) only pi . If the module in- 
dex of ^ is larger than the one of pj, proceed analogously. 
If one p reaches the end of the row, merge the remaining 
elements of the other row according to the respective rule. 
This procedure will be called "asynchronous parsing" in 
Sec. IIC. It is customary to update each AQ element 
after calculation. To complete the merge process it re- 
mains to update the community degrees and to flag the 
modified communities in the touched vector. 



C. Running time estimation of MSG 

As we adopted the modularity change calculation of 
Clauset et al. (Sec. II in Ref. [9]) we can adopt their 
method of running time estimation as well. First, we ob- 
serve that the update of one element in the AQ matrix 
and the level set costs in the worst case 0(log(A^)) (inser- 
tion in set, each community has at most TV neighbors with 
N the number of vertices) and (9(log(M)) = 0(log(iV)) 
running time (the number of distinct edges M is bounded 
by the square of the number of vertices N 2 ), respectively. 

Merging communities i and j involves an update of 
the AQ matrix and the level set for each element of the 
corresponding rows of the AQ matrix . The calculation 
of each changed value can be achieved in constant time 
as during the asynchronous parsing it is known whether 
the other community is linked as well and all other in- 
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formation (community degrees) is stored in a vector. 
Thus, the total running time contribution of one merging 
event is 0((di + dj)log(N)) with dk the number of edge 
starts/ends on vertices of community k = In the 

worst case all communities are changed in one algorithm 
round. As the sum over all di values is twice the number 
of distinct edges, the contribution of the merging pro- 
cesses in one algorithm round is at most 0(Mlog(TV)). 
The other steps of one algorithm round are less con- 
sumptive: The extraction of pairs belonging to the best 
I levels can be performed in constant time. The same 
is true for the update of the degree information. If D 
is defined as the depth of the dendrogram of communi- 
ties, at most D algorithm rounds have to be performed. 
Thus, the running time expectation for the iterative part 
is 0(DM log(TV)) which is identical to the complexity of 
the classical greedy algorithm [9]. 

The initialization involves the read-in processes of the 
edge information (M constant time operations), the de- 
gree calculation (part of read-in process), the calculation 
of the initial modularity (constant time operation on TV 
elements) and finally the generation of the AQ matrix 
and the level set at costs 0(M log(TV)) (M insertions in 
a set with at most TV or M elements, respectively). In the 
worst case the expected contribution of the initialization 
to the running time is 0(M log(TV)). 

In the precedent paragraphs we have shown that 
the MSG greedy algorithm has the total complexity 
0(DM log(TV)). Among the published strategies for 
modularity optimization the classical greedy algorithm 
[9] is the fastest [6]. As the MSG shares the worst case 
expectation for the running time with the classical greedy 
algorithm, we conclude that the MSG is one of the fastest 
procedures for modularity optimization. 



Therefore, the selection of the optimal intermediate par- 
tition as in the Kernighan-Lin algorithm is not necessary. 

E. VM implementation 

The modularity change AQ upon reassignment of ver- 
tex v from community i to j can be written as 

links(w <-> j) — links(w <-> i) k v [dj — di\ v ) 

with k v the degree of vertex v, dj the sum over the de- 
grees of all vertices in community j, d t \ v = di — k v the 
corresponding degree for community i without vertex v, 
and L the total weight of all edges. 

The most time consuming part of the VM is the cal- 
culation of the modularity changes upon reassignment of 
the vertices. Consequently, Eq. (2) reduces this bottle- 
neck to the calculation of weight of the edges connecting 
the vertex to the neighboring communities. The connec- 
tivity information of vertex v is stored in a sparse vector 
[i.e., a vector of elements (u, w vu ) with u a vertex linked 
to v and w vu the total weight of all edges connecting ver- 
tices u and v]. These rows are stored in a vector and 
form the topology matrix. To determine the total edge 
weight connecting vertex v with community j the vth 
row is parsed and for each entry the weight is added to 
the subtotal edge weight of the corresponding commu- 
nity. To keep access times short a TV-dimensional vector 
(TV the number of vertices) is chosen to store the inter- 
mediate links(w <-> j) results. The optimal reassignment 
partner for vertex v is the community with smallest index 
yielding the maximal modularity improvement. 



D. Vertex mover (VM) 

To further improve modularity by "adjusting" mis- 
placed vertices, a refinement step called "vertex mover" 
(VM) is applied upon convergence of the MSG algorithm. 
In principle, it could also be applied to other modularity 
optimization procedures. In the VM, the list of vertices is 
parsed in the order of increasing degree and vertex index 
(to resolve the degeneracy of multiple vertices with equal 
degree) and every vertex is reassigned to the neighboring 
community with maximal modularity improvement. This 
parsing-and-reassignment procedure is repeated until no 
modularity improvement is observed. 

The VM procedure is similar to the Kernighan-Lin 
algorithm [10] (applied to modularity optimization in 
Ref. [11]). In contrast to the Kernighan-Lin algorithm 
the VM procedure has a perfectly local focus. In other 
words, instead of repetitively searching for the optimal 
vertex to reassign, the VM procedure parses the vertices 
in the aforementioned order and identifies the optimal 
community for the considered vertex. Furthermore, each 
reassignment of the VM approach improves modularity. 



F. Estimation of VM running time 

Calculating the modularity changes upon reassignment 
of one vertex to any neighboring community involves one 
parse of its edge list supplemented with direct memory 
access to determine the community affiliation and some 
constant time operations for the actual modularity calcu- 
lation. Therefore, the running time contribution of one 
vertex is proportional to its degree. One algorithm round 
requires 0{L) = 0(2, di) running time. The estimation 
of the number of needed iterations is not possible as it 
depends on the quality of the MSG result. In all exam- 
ples tested by us the running time of the VM was always 
at least one order of magnitude smaller and less than one 
minute even for the biggest networks under study. 

III. RESULTS 

A. Test set of networks 

For benchmarking algorithms that optimize modular- 
ity the networks commonly used are the collaboration 
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FIG. 2: (Color online) Dependence of MSG modularity value Qmsg(0 (blue), MSG-VM modularity value QmSG-VM W (black) 
on the level parameter I relative to maximal MSG-VM modularity value Q max . The previously published result <2 P ub/Q max 
(dashed green line) is also shown as basis of comparison. The red circles indicate the value of I that yields maximal modularity. 
A significant number of /-values yield higher modularity than the previously published maximal modularity for all but the 
smallest two networks, i.e. Zachary (not shown) and College. In the latter, only / = 1 yields a higher modularity than Q pu b- 



network (coauthorships in cond-mat articles) [12], the 
graph of metabolic reactions in Caenorhabitis elegans 
[13], the email network [14], the network of mutual trust 
(PGP-key signing) [15, 16], the conference graph of col- 
lege football teams [17], the network of jazz groups with 
common musicians [18] and the Zachary karate club ex- 
ample [19]. In addition, we include less frequently used 
examples such as the graph of the metabolic reactions 
in Escherichia coli [20], two different data set describing 
the protein-protein interactions in S. cerevisiae (budding 
yeast) [21, 22] with labels "PPI" and "yeast." To cover lin- 
guistic applications we benchmark the word association 
network [23] and the graph of the co-appearing words 
in publication titles (co) authored by Martin Karplus [24] 
who has the third highest ^.-factor [25] among chemists 
[26]. Further aspects of social webs were incorporated by 
considering the graph of costarring actors in the IMDB 
database [27]. Noticeable, the actor network - being the 



network with the largest number of edges - serves as a 
proof of concept for such big networks being treatable 
as well. From computer science we include the internet 
routing network [28] and the graph of World Wide Web 
pages [29]. With this selection of networks most cur- 
rently known application fields of networks are covered. 
To study the effect of disconnected graphs and weighted 
networks, we consider in both cases the full network as 
well as the largest connected component (suffix "CP") 
and the unweighted variant, respectively. Unless stated 
otherwise the networks are treated unweighted. 



B. Dependence on I and vertex labeling 

It is important to investigate the robustness upon the 
choice of I and to determine the highest modularity val- 
ues achievable with the MSG-VM algorithm. There is a 
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TABLE I: Results on real-world examples. Among all tested level parameters (all positive integers smaller than 5000 or the 
number of edges if smaller) the value l op t yields the highest value of Q for the considered network. Nc is the number of 
communities found. In most cases, a larger number of communities (larger Nc) is identified by the classical greedy than the 
MSG-VM extension because the former partitions the network in few large communities and many small communities with 
less than ten vertices (mostly 2-20 times more small modules identified by greedy than MSG-VM). The MSG-VM approach 
prevents the condensation into few large modules: The three largest modules contain between 1.5 and 4 times less vertices in 
the MSG-VM partition than in the greedy partition (not shown). The running time (on a recent laptop) is reported for a single 
run of the algorithm. The entry "na" indicates that the running time is shorter than 1 s and therefore not displayed. The suffix 
"CP" points out that only the largest connected component (the "central part") was considered. The acronym "PPI" stands for 
"protein-protein interaction." 
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TABLE II: Comparison of maximal value of modularity ob- 
tained by the MSG-VM algorithm Qmf x 3_VM with previously 
published results <3 pu b. The highest published value was ex- 
tracted from the referenced paper ("Source") where it has been 
calculated by the "Method" whose reference is listed in the last 
column. 



minor dependence on the value of I (Fig. 2) which changes 
the MSG-VM modularity by less than 2 % for large net- 
works. Moreover, the maximal modularity is obtained 
with I < 300 for 14 of the 19 networks (Table I). An 
empirical formula for the optimal choice of the level pa- 



rameter will be presented elsewhere. 

Noteworthily, for a labeled graph and a chosen level 
parameter the algorithm is deterministic. To assess the 
contribution of the labeling, the benchmarking procedure 
is performed also on hundred copies of the smallest ten 
networks with permuted vertex labels. This permuta- 
tion leaves the topology invariant, but modifies the order 
in which the community pairs are considered. In com- 
parison to the maximal modularity value found for the 
unscrambled variants a maximal improvement of 0.94 % 
is observed. 



C. Performance and running time 

The modularity values obtained with the MSG-VM 
approach are listed in Table II. For five of the seven 
networks considered here the MSG-VM algorithm finds 
solutions with modularity higher than previously pub- 
lished. Only for the Zachary Karate network the MSG- 
VM procedure yields a smaller modularity value. For the 
jazz network a solution with the identical Q value is ob- 
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tained. For the networks without published modularity 
values we compare the optimal values obtained by the 
MSG-VM algorithm with the classical greedy algorithm 
for modularity optimization as introduced by Newman 
[5] in Table I. We observe that the MSG-VM algorithm 
outperforms the original greedy algorithm significantly. 

The running time estimations in Sees. II C and II F are 
based on a worst case scenario. To investigate the run- 
ning time behavior on real-world examples, we compare 
the running times of the classical greedy variant and the 
MSG-VM algorithm in Table I. These data show that 
given the appropriate level parameter choice the MSG- 
VM algorithm is in almost all cases faster than the clas- 
sical greedy algorithm and, at the same time, reaches a 
higher value of modularity. 

IV. CONCLUSIONS 

To prevent premature condensation into few large com- 
munities the greedy algorithm for modularity optimiza- 
tion has been extended by a procedure for simultaneous 
merging of more than one pair of communities at each 
step. Furthermore, this "multistep" greedy variant has 
been combined with a simple vertex-by-vertex a posteri- 



ori refinement. On seven networks with previously pub- 
lished modularity values the MSG-VM algorithm com- 
bination outperforms all other frequently used, generic 
techniques except for the smallest of the seven examples. 
In addition, a single run of the MSG-VM algorithm re- 
quires similar computer time as the greedy algorithm. In 
most cases less than 10 independent (i.e., embarrassingly 
parallel) runs of MSG-VM are required to obtain a mod- 
ularity within 1 % of the highest value because an empir- 
ical formula has been derived for the appropriate choice 
of the optimal step- width. Therefore, the MSG-VM algo- 
rithm is an efficient tool to find network partitions with 
high modularity [30]. 
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